Machine Learning Potential Model Based on Ensemble Bispectrum Feature Selection and Its Applicability Analysis

نویسندگان

چکیده

With the continuous improvement of machine learning methods, building interatomic potential (MLP) based on datasets from quantum mechanics calculations has become an effective technical approach to improving accuracy classical molecular dynamics simulation. The Spectral Neighbor Analysis Potential (SNAP) is one most commonly used potentials. It uses bispectrum encode local environment each atom in lattice. hyperparameter jmax controls mapping complexity and precision between descriptor. As increases, description will more accurate, but number parameters descriptor increase dramatically, increasing computational complexity. In order reduce without losing accuracy, this paper proposes a two-level ensemble feature selection method (EFS) for descriptor, combining perturbation selector strategy. Based proposed method, subset selected original dataset dimension-reduced MLP. application validation, data Fe, Ni, Cu, Li, Mo, Si, Ge metal elements are train linear regression model SNAP predicting these metals’ atomic energies forces them evaluate performance subsets. experimental results show that, compared features qSNAP, training our EFS qSNAP than SNAP. Compared with existing when size 0.7 times that features, SSWRP strategy can achieve best terms stability, achieving average stability 0.94 across all datasets. reduced by about half, prediction 30%.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Constructing response model using ensemble based on feature subset selection

In building a response model, determining the inputs to the model has been an important issue because of the complexities of the marketing problem and limitations of mental models for decision-making. It is common that the customers’ historical purchase data contains many irrelevant or redundant features thus result in bad model performance. Furthermore, single complex models based on feature s...

متن کامل

Correlation-based Feature Selection for Machine Learning

A central problem in machine learning is identifying a representative set of features from which to construct a classification model for a particular task. This thesis addresses the problem of feature selection for machine learning through a correlation based approach. The central hypothesis is that good feature sets contain features that are highly correlated with the class, yet uncorrelated w...

متن کامل

Classification of lung cancer using ensemble-based feature selection and machine learning methods.

Lung cancer is one of the leading causes of death worldwide. There are three major types of lung cancers, non-small cell lung cancer (NSCLC), small cell lung cancer (SCLC) and carcinoid. NSCLC is further classified into lung adenocarcinoma (LADC), squamous cell lung cancer (SQCLC) as well as large cell lung cancer. Many previous studies demonstrated that DNA methylation has emerged as potential...

متن کامل

Ensemble-based Feature Selection Criteria

Recursive Feature Elimination (RFE) combined with feature ranking is an effective technique for eliminating irrelevant features when the feature dimension is large, but it is difficult to distinguish between relevant and redundant features. The usual method of determining when to stop eliminating features is based on either a validation set or cross-validation techniques. In this paper, we pres...

متن کامل

An Intelligence-Based Model for Supplier Selection Integrating Data Envelopment Analysis and Support Vector Machine

The importance of supplier selection is nowadays highlighted more than ever as companies have realized that efficient supplier selection can significantly improve the performance of their supply chain. In this paper, an integrated model that applies Data Envelopment Analysis (DEA) and Support Vector Machine (SVM) is developed to select efficient suppliers based on their predicted efficiency sco...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Metals

سال: 2023

ISSN: ['2075-4701']

DOI: https://doi.org/10.3390/met13010169